Phonetic Refraction for Speaker Recognition

نویسندگان

  • Mary A. Kohler
  • Walter D. Andrews
  • Joseph P. Campbell
  • Jaime Hernández-Cordero
چکیده

This paper describes a newly realized highperformance speaker recognition system and examines methods for its improvement. Innovative experiments early this year showed that phone strings are exceptional features for speaker recognition. The original system produced equal error rates less than 11.5% on Switchboard-I audio files. Subsequent research indicates that the equal error rate can be nearly halved by improving the feature extraction and score fusion methods. Pre-processing the speech files to remove cross-talk, improved techniques for combining scores, and gender-specific phone models each reduce the error rates significantly.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonetic Speaker Recognition

The aim of this study is to answer two questions regarding the use of phonetic information for speaker modelling. We formulate answers for (1) what are the discriminative powers of broad phonetic classes for the task of speaker identification? (2) Are the phonetic speaker models more suitable for speaker recognition than standard models?

متن کامل

On the Relationship between Phone Phonetic Speaker Recog

Speaker recognition techniques have traditionally relied on purely acoustic features and models. During the last few years, however, the field of speaker recognition has started to show interest in the use of higher level features. In particular, phonetic decodings modeled with statistical language models (n-grams) have already shown its effectiveness in several research works. However, the rel...

متن کامل

Phonetic, idiolectal and acoustic speaker recognition

This paper describes a text-independent speaker recognition system that achieves an equal error rate of less than 1% by combining phonetic, idiolect, and acoustic features. The phonetic system is a novel language-independent speakerrecognition system based on differences among speakers in dynamic realization of phonetic features (i.e., pronunciation), rather than spectral differences in voice q...

متن کامل

Phonetic Speaker Id

This paper describes the exploration of text-independent speaker identification using novel approaches based on speakers’ phonetic features instead of traditional acoustic features. Different phonetic speaker identification approaches are discussed in this paper and evaluated using two speaker identification systems: one multilingual system and one single language multiple-engine system. Furthe...

متن کامل

Generative Acoustic-Phonemic-Speaker Model Based on Three-Way Restricted Boltzmann Machine

In this paper, we argue the way of modeling speech signals based on three-way restricted Boltzmann machine (3WRBM) for separating phonetic-related information and speaker-related information from an observed signal automatically. The proposed model is an energy-based probabilistic model that includes three-way potentials of three variables: acoustic features, latent phonetic features, and speak...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001